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ABSTRACT 

A method is presented for estimating the background at a given location on a sky 
map by interpolating the estimated background from a set of concentric annuli which 
surround this location. If the background is nonuniform but smoothly varying, this 
method provides a more accurate (though less precise) estimate than can be obtained 
with a single annulus. Several applications of multi-annulus background estimation are 
discussed, including direct testing for point sources in the presence of a nonuniform 
background, the generation of “surrogate maps” for characterizing false alarm rates, 
and precise testing of the null hypothesis that the background is uniform. 


Subject headings: methods: data analysis — methods: statistical — surveys — stars: 
imaging — ultraviolet: stars 


Introduction 


This study is motivated by the search for rare bright transient point sources in count-limited sky 
maps generated from data taken by the ALEXIS (Array of Low Energy X-ray Imaging Sensors) 
satellite. Each of the six telescopes in the array uses a spherical narrow-band normal-incidence 
multi-layer mirror with a micro-channel plate detector at the prime focus to detect individual EUV 
photons over a wide 33° field (Priedhorsky et al. 1988|; Bloch et al. 1993|). The data are taken 


in a scanning survey mode, with almost half of the sky (the anti-solar hemisphere) covered every 
fifty seconds. Individual photons are time-tagged, and after accounting for spacecraft attitude as a 
function of time ( |Psiaki et al. 1997 ), each photon is identified with a position on the sky. In this 
original form, the “map” is a photon event list, but in practice, these photons are binned into square 
pixels in a Hammer-Aitoff or a quadrilateralized spherical cube projection ( |Greisen &; Calabretta 
1995; Greisen &; Calabretta 1 99t( ) of the sky. For a given candidate point source location, we 
estimate a source strength by counting photons in a small region of the map about the size of the 
telescope point spread function (more accurate estimates can be obtained by convolving the photon 
counts with the point spread function), and comparing that to an estimate of the background. 


If quantitative models are available for all of the various noise sources in the detector and on 
the sky, then ab initio estimates of the background can be made. To be useful, these models must 
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be accurate and reliable, and this is not always possible; for ALEXIS, one of the main reasons for 
going into space was to identify and quantify these various backgrounds for this new multi-layer 
instrument technology. A simple alternative is to estimate the background by looking at count 
rates in the vicinity of the candidate point source ( e.g see Bowyer et al. 1996| ). Annular regions 
are particularly useful, since they are insensitive to linear gradients in the background, but there 
is still an implicit assumption that the background is spatially uniform (or at worst has only linear 
variation) over the area that includes the point source location and the background estimation 
region. 

With the ALEXIS data, the spatial nonuniformity of the background is quite evident (see 
figure |]a), and arises from a number of effects: among these are uneven exposure of the scanning 
telescopes, telescope vignetting, pinhole leaks, detector masks, time-varying high energy penetrating 
particle flux, an anomalous background component which varies with the angle between the look 
direction and the spacecraft velocity vector (Bloch et al. 1994), motion of the moon across the 


field, and possibly even the spatial structure inherent in the diffuse X-ray sky. In this study, we are 
not concerned with nonuniformities arising from source confusion since point sources are relatively 
sparse at the ALEXIS sensitivity. Most of these effects are approximately known, and work is in 
progress to more accurately model them. However, our approach for point source detection is to 
estimate background from the count map itself. 


In this article, we will describe the use of multiple concentric annuli for characterizing a spatially 
nonuniform background. The most direct application is the estimation of background at a candidate 
point source location. The multiple-annulus approach is mathematically equivalent to fitting the 
background with a two-dimensional polynomial of order q = 2 n— 1, where n is the number of annuli. 
Direct fits, even for relatively low order q, can be unwieldy, but by integrating these polynomials 
over concentric annuli, we find that the interpolation procedure can be considerably simplified. 
With two annuli, for example, we effectively fit a cubic polynomial, but we compute only two 
coefficients instead of ten. 


If multiple-annulus estimates of the background can be made more accurate, then a point 
source detection algorithm that is based on these better estimates will be more sensitive to real 
point sources for a fixed rate of false alarms. A low false alarm rate is always desirable, but what is 
particularly important is that the false alarm rate be well calibrated. We show how this can be done 
using multiple-annulus methods to estimate a smooth but nonuniform estimate of the background. 
From this background, one then generates a Monte-Carlo ensemble of “surrogate” maps, and by 
applying the point source detection algorithm to these maps (which have no point sources), one 
can estimate the false alarm rate. Finally, we will show how multiple annuli can also be used to 
characterize the magnitude of the background nonuniformity. In particular, we will describe a test 
of the null hypothesis that the background is uniform. The test, based on counts in concentric 
annuli, is especially sensitive to the nonuniformities that lead to poor estimates of the background 
level at the center of the annuli, yet is completely insensitive to linear gradients which have no 
effect on background estimates at the center of the annuli. 

In Section 2, we will introduce notation, and then derive the linear combination of background 
annuli counts that provides an unbiased estimator for background in a source region. We will 
consider in particular the use of two annuli, the simplest and probably most useful case, as well 
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as the limit of an infinite number of annuli. In Section 3, we will derive the bias and variance 
(a.k.a. accuracy and precision) of multiple annulus estimators, compare the performance of single 
and multiple annulus methods, derive optimal annulus partitions, and suggest heuristic algorithms 
which trade off bias and variance error. In Section 4, we will illustrate the use of multiple annuli 
on two example data sets, one real and one artificial. In that section, we will provide four different 
applications of multiple annuli estimators: estimating background level, detecting point sources, 
generating surrogate maps, and quantifying background nonuniformity. 


2. Background estimation with multiple annuli 


In this section, we will derive expressions for the average background in a source region as a 
linear function of the backgrounds integrated over annular regions surrounding the source. We will 
derive separate expressions for square and circular regions, but both will have the property that 
the coefficients of the linear combination depend only on the geometry of the annuli. 


For a given region S of a sky map, we will use N$ to denote the observed number of counts 
(photons) in the region, and Ms to denote the expected number of counts due only to the smoothly 
varying nonuniform background. These are dimensionless “counts” and should not be confused 
with the count “rate” (photons per unit time). For a given position (x,y) on the sky map, let 
B(x,y) denote the “count density” of the background - expected counts per unit area at that 
position. Then, 

M s = JJ^B(x,y)dxdy. ( 1 ) 

The average count density in a region S is called B$ and is given by 


Ms = ffs B ( x ,y) dx dy 

As ffs dx dy 


( 2 ) 


where As is the area of region S. 


Here, Ms is the “true” background level. If there is only background (no point sources) in 
S, then the actual number of observed counts Ns will be Poisson-distributed with mean Ms- For 
example, if S is the annular region around the candidate point source, then we will usually estimate 
the background level with Bs = Ms/As = Ns/As¬ 


ia what follows, we will assume that the background count density B(x,y) can be accurately 
approximated, over a small region around the potential point source, by an order-q Taylor series 
polynomial^: 

B { x ,y) = c nm xn y m - (3) 

n-\-m<q 


The formalism that we will develop applies to both square and circular annuli. Square annuli 
are more conveniently employed in maps built from square pixels, while circular annuli are more 


1 We recognize that even large order polynomials will not model abrupt discontinuities in the background - these, 
alas, are all too common in actual practice. Potential point sources near background discontinuities must be treated 
with special care. 
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convenient if the data are recorded in a photon event list. We will characterize these annuli in 
terms of moments, Rk, defined as follows: Let r ? ; and r Q denote the inner and outer radii of the 
annulus (a disk is just an annulus with r* = 0). It will turn out that only the even moments are 
important, so we will define 


Rk( r i,r 0 ) 


1 r 2k+2 _ r 2k+2 

' o _ ' i 

k + l r'l - rf 
k 

1 V r 2 ir 2k ~ 2j 

Hlf"/ ' 


(4) 

(5) 


For circular annuli, this corresponds precisely to the 2/c-th moment (r 2k ). For square annuli, there 
is a fc-dependent prefactor which is not important for our purposes. 


2.1. Average counts in a square annulus 


Let S be a square of side 2r {i.e., of “radius” r). Then, from equation (1;) and equation 
we can write the expected number of counts in the square as 


+S = E 


x n dx / y m dy. 


( 6 ) 


n-\-m<q 


Since the square is symmetric in both x and y, it follows that § r _ r dxx n f_ r dyy m is zero unless 
both n and m are even. So we need only perform the sum over even indices; that is: 


A/’s = 


C 2 n, 2 m [ x 2n dx f y 2m dy 

. r, J — T j —V 


2n-\-2m<q 


— ^2, ° 2n < 
2n+2m<q 

E 


2 ? .2n+l 2 r 2m+l 


2m. 


2n + 1 2m + 1 


4C2n,2m 


2n+2m<q 


(2 n + l)(2m + 1) 


^271+2771+2 


(7) 

( 8 ) 
(9) 


For an annulus with inner radius r* and outer radius r Q , the expected number of counts is given by 

^C2n,2m 


+s = E 


(2 n + l)(2?n + 1) 


( ? .2n+2m+2 _ r 2n+2m+2\ 




m 


2n+2m<q 

Following equation (^), we divide by the area of the annulus to get the count density 


Bs= £ 


C2n,2m 


2n-\-2m<q 


(2n+ l)(2m + 1) 


' r 2n+2m+2 _ r 2n+2m+2' 

T 2 — r? 


( 11 ) 


In particular, if we define 




k + l 


(2 j + 1)(2 k - 2 j + 1) 


C2j,2k-2j 


( 12 ) 
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and invoke the definition in equation (||), we can write 

L?/2J 

Bs = X! L kRk- (13) 

k =0 

where L depends only on the underlying polynomial, and depends only on the geometry of 
the annulus. Here [q/2\ is the “floor” of qj 2; that is, the largest integer less than or equal to q/2. 
What is remarkable about this expression is that there are only 1+ [q/2\ variables (the L^s) which 
depend on the (q + l)(g + 2)/2 polynomial coefficients c nm . If our goal is to estimate B$ for some 
area S , then we only need to estimate 1 + Yq/2\ separate coefficients. 


2.2. Average counts in a circular annulus 


It is fairly straightforward to adapt the above derivation to circular annuli. Writing equation (Jlj) 
in polar coordinates, we get 


r 2ir 


Ms = / B(x,y)dxdy = / r dr B(r cos 9, r sin 0) dO 
J S J ri JO 

Expanding B(x,y) as a polynomial of degree q we obtain 


r 2n 


Ms = J2 Cnm/ rr n+m dr cos n 9 sin m 6 d6 

n+m<q r% 


(14) 


(15) 


It is clear that cos n 9 sin m 9 d9 vanishes if either n or m are odd (and that it is strictly positive 
when both n and m are even), so we can rewrite: 


rr r2n 

Ms = y C2n,2m r r 2n+2m dr cos 2n 9 sin 2m 9 d9 
’ Jo Jo 


2n-\-2m<q 


Let us define 


1 

K nm = — / cos 2n 9 sin 2m 9 d9. 
27r Jo 


(16) 


(17) 


One can show that K nm = (2 m — l)!!(2n — l)!!/2 n+m (n + rn)\. where n\\ = n[n — 2)(n — 4) • • • 1, 
but we don’t actually need the closed form. In terms of K nm , we can write 


Ms = y 


C2n,2m{2 7T Knm) ( 2n+2m+2 2n+2m+2 


2n+2m<q 


2 n + 2 m + 2 


(d 


— T; 


Now, define 


so that 


Lk /* ) Mi k—i C2i,2k—2i 
i =0 


L9/2J 


‘■IhI 


r 2k+2 _ r 2k+2 


(18) 


(19) 


( 20 ) 
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and then combining equation (|||) and equation (||) 


B s 


Ms /area (S) 

Lq/2J t 
7T B k 

k =0 


k + 1 


2fc+2 _ 2k+2 

' o _ ' i 

nr? — nr? 


L9/2J 

^ ' LfcRk- 
k =0 


( 21 ) 

( 22 ) 

(23) 


where, again, the 1 + \_q/2 J variables L k depend on the underlying polynomial but not on the 
geometry of the annuli, and the R k depend only on the annuli. 


2.3. Using multiple concentric annuli to estimate source background 


The advantage of the decomposition of B$ into a sum of the form given in equation (|b]), and 
again in equation (p3|), can be seen when we try to estimate the background count density in a 
central region B src from the count densities B s estimated from each of the concentric annuli. 

Assume that the background can be well-modeled (at least in a region containing all the annuli) 
by an order q polynomial, and that we have 1+ [q/2\ concentric annuli surrounding the point source 
candidate. 


Let R ks denote the R k value given in equation ([|) for the s-th annulus. This is essentially the 
2/e-th moment and is a purely geometrical property of the s-th annulus. Let B s be the average 
count density in that annulus. Under these assumptions, the 1 + [q/2 J annuli all satisfy 

L9/2J 

B s = y L k R ks (24) 

fc =0 


which is a linear system of 1+ \_q/2\ equations in 1+ \q/2\ unknowns. In particular, let R sk denote 
the s, k element of the inverse of the R ks matrix. That is, J2k R-skRks 1 = <W. Then L k = J2 S BsR-sk- 
If the central source region has moments R k , s rc , then we can write the count density in the central 
area as 

-®src = ^ ^ L k R h ,src — y, y B s iz sk Rk ,src (25) 

k k s 


or, if we define 

As = TRskRk^ (26) 

k 


then 


B s 


s 


(27) 


is a simple linear combination of the B s ’s, and the coefficients (3 S depend only on the geometry of 
the annuli. In particular, if we estimate B s with B s = N s /A s , then we can estimate B src with 


B' src = J2fcN s /A s , 


(28) 
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where the prime indicates that this estimate of B src is obtained from the surrounding annuli and 
not from the direct N STC /A src . Note that the f3 s ’s are not necessarily positive; thus, it is possible 
for equation (28) to produce a negative estimate of the background B src . 


2.3.1. An example: two concentric annuli 

Consider a source region 5 src surrounded by two concentric annuli: 5i nner and 5 0 uter- See 
figure ||[ If B(x, y) can be accurately modeled as a cubic function of (x, y ) in the region containing 
the annuli, then we can write 

Bsrc — L 0 A L \ r src 

-Sinner — Lq A L ] Hnner (29) 

Souter = Lq A L i r outer 


where Lq and L\ are two scalars which depend on the underlying cubic background function, and 
r sr c, Hnner, and r ou t er are the “average radii” of the center, inner, and outer annuli. This average is 
defined in terms of the second moment for each of the three areas: 


r% = Ri,S = 


1 

2 




(30) 


Here r lt s is the inner radius, and r 0) s is the outer radius, of the annulus S. We can now estimate 
the average count density for the central area as a linear combination of the count densities in each 
of the two surrounding annuli: 


B 


src 


P-B inner + (1 ~ P)B ou ter 


(31) 


with 


P 


r 2 _ 2 
1 outer 1 src 

r 2 — r 2 
' outer ' inner 


(32) 


We remark again that f3 depends only on the geometry of the the background annuli and the 
source region. It does not depend on the underlying background nonuniformity. Thus, having 
set up a geometrical configuration, one needs to compute (3 only once. We also remark that since 
Hnner > r src , we have f3 > 1; this implies that the coefficient of -B oute r is always negative. A “typical” 
value is f3 = 1.5 (and l — (3 = —0.5), which corresponds to the case that both annuli have the same 
area and are much larger than the hole in the inner annulus. 


It is clear from equation ( |29| ) that a plot of B versus r 2 will produce a straight line. This pro¬ 
vides a convenient visualization (as seen in figure ^|) of the “linear extrapolation” of the background 
from the outer and inner rings to the central source region. 


2.3.2. Overdetermined case 

Although we need at least 1 + \q/2\ annuli to fit an order q polynomial, there is no reason not 
to use more than the minimum number of annuli. For the q = 3 case, we can imagine replacing 




the “linear extrapolation” in figure 0 with a linear fit 


be careful about the Poisson bias ( |Wheaton et al. 1995|) 
coefficients (3 S however we like so long as the following conditions are met: 


if we do it this way, however, we have to 
In general, we are free to choose our 


Rsrc,k — 'y ( PsRs,k 


(33) 


for k = 0,..., [q/2\. These conditions ensure that if the background varies as an order-t/ polynomial, 
then equation (|27]) will hold, and we will have an unbiased estimator. In later sections, we will 
discuss different approaches for optimizing the choice of the coefficients /3 S . 

It is useful to consider the limit of an infinite number of annuli, and to take the continuum 
limit. The role of the coefficients /3 S is then played by a function j3{r) where r is the radius of the 
infintesimal annulus.^ The estimator for the average background in the source region is given by 
an integral 

B' s rc = J (3(r)B(r)2irr dr (34) 

but this can be more usefully written as a sum over all the individual counts (photons): 


N 

-^src / v 
71=1 


P(r n 


(35) 


where r n is the distance^ from the candidate point source location to the position of the n-th photon. 
The conditions to fit an order q polynomial are obtained by setting B(r) = r 2k for k = 0,..., |_g/2j, 


and employing equation (34) to obtain 


J (5(r)2Tir 2k+1 dr = R src . k - 


(36) 


Note that in the limit as the radius of the source region goes to zero, we have R src h = <5fc,o- In 
this limit, we are estimating the background density at a point, instead of estimating the average 
background over a region. 


Equations (34,36) are written for circular annuli; essentially the same result can be applied 
for square annuli, but with l 2tt 7 (perimeter of unit circle) replaced by ‘8’ (perimiter of unit-radius 
square). 

The use of a continuum kernel for simultaneous point source detection and background sub¬ 
traction is also possible ( Damiani et al. (1997|) use wavelets for this purpose). The optimal kernel 
can be constructed to match the point spread function of the telescope and at the same time be 
insensitive to polynomial background nonuniformity. 


2 Actually, the /3(r) that we will use is not exactly the continuum analog of /3„; it’s more like f3(r) ~ /3 S /A S . 

3 The distance is Euclidean (r = x 2 3 * * + y 2 ) if we are using circular annuli - which we might as well do, if we have 

distances available as floating point numbers. The distance is given by the “maximum” metric (r = ma x(x,y)) if we 

are using square annuli. 
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3. Error (variance and bias) in background estimation 

In the limit as the background densities B s are precisely known, the estimate of B src given by 
equation ( p7|) will be precise. In practice, however, these background densities are estimated from 
a finite number of counts in each of the annuli. 

If there are N s counts in an area A s in the s-th annulus, then B s = N s /A s is the estimated 
count density. And since these counts arise from a Poisson source, the average squared error in B s 
will be given by 

E 2 s = (( B a - B s ) 2 ) = B s /A s . (37) 

Using this expression, we can write the averaged squared error for our estimate in equation (^) of 
the count density in the source area as 

EL = ((B' SIC - 5 src ) 2 ) = £ P 2 E 2 = £ f3 2 B s /A s (38) 

s s 

It is straightforward to extend this expression to the continuum limit: 

E 2 rc = J /3 2 (r)B(r) 2nr dr. (39) 

where we define B(r) = ^ / 0 27r B(r, 6) dd. 

The above equations express the variance in the estimate of background count density at the 
source location; in general, there will also be a bias. If the background is well fit by a polynomial 
of order q, then the bias is zero. One way to estimate this bias is to assume that the background 
is well-fit by an order q' polynomial, with q' > q. Then the bias in the order-g 7 multi-annulus 
estimator is zero, and the bias in the order-g estimator will be given by the difference between 
these two estimators. In general, using smaller values of q will increase the bias, but at the same 
time reduce the variance. Determining the optimal order q is therefore a trade-off between bias 
and variance. (In our experience, however, the optimal q is never larger than 3.) 


3.1. Uniform background 


For the moment, consider the case that the background is uniform (so that B s = B for all s). 
The bias in this case is zero, and only the variance is relevant. For the single annulus estimator, 
the variance is given by 

Vi = B/A (40) 

where A = A; nner + A outer . For the two annulus estimator, we have 


U 2 


(3 2 B/A- mner + (1 — /3) 2 B/A ou ter — 


P 2 l (l-/3) 2 \ 


B/A 



{a - /?) 2 \ 
a(l — a) I 


(41) 


B/A 


(42) 
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where (3 is given by equation (32), and we have defined 


Qt — dinner/(dinner “I - ^outer)' 


(43) 


It it clear from these expressions that V 2 can never be smaller than Vj ; in fact, from the geometrical 
constraint that the annuli and source region cannot overlap, it is not too hard to show that V 2 /V 1 
is always larger than five.j] 

Note that equation ( |42| ) provides a criterion for choosing an optimal partition of a background 
annulus into inner and outer annuli. When the background is uniform, it is straightforward to show 
that this optimum partition occurs when the areas of the inner and outer annuli are equal. In this 
case, a = 0.5, and V 2 = (l + (2/3 — l) 2 ) V\. 

When the background is uniform, preference clearly goes to the one-annulus estimator, since 
it has substantially smaller variance. If we were to compute the variance for a three-annulus 
estimator which fit an order q = 5 polynomial to the background, we would find an even larger 
variance. On the other hand, using more than two annuli, but sticking to q = 3, will reduce the 
variance somewhat. This is most evident in the continuum limit. 


In the continuum limit, there is a simple criterion for choosing the optimal function (3(r): 
minimize the variance in equation 


while maintaining the constraints in equation (36). Using 
the method of Lagrange multipliers, and some variational calculus, one can show that the optimal 
function is of the form 


(3(r) = 


L?/2J 

E 

k =0 


A hr 


2k 


B(r) 


(44) 


where Xu are constants whose values are determined from the conditions in equation (|36|) for 
k = 0,..., [q/2\. 


As an example, consider the case that B(r) is constant, that q = 3, and that the source region 
and annulus inner radius are both much smaller than the annulus outer radius. In that case, we 
have (3(r) = a + br 2 . The constraints in equation (36) become 



(45) 

(46) 


and from these, we obtain 


P(r) 


4 6 2 

ttR 2 7ri? 4 


(47) 


4 The factor of five is achieved in the limit as the inner radius of the inner annulus goes to zero, and h; nrler 
we have /3 = 1.5, a = 0.5 in this case, and then from equation (1420, V 2 /V 1 = 5. 


= A, 


outer i 














-li¬ 


as the optimal coefficient function. The variance of this optimal continuum estimator is 


r R 


V C = 


/3 2 (r)B(r) 2i rr dr 


= 2 nB 


r 

■ 4 

6 2 1 
-- r 

)o 

ttR 2 

TiR! i 


r dr = 


4B 

nR 2 


We see that this value is four times larger than the single-annulus 
smaller than the equivalent two-annulus variance. (See figure ll|.) 


(48) 

= 4 B/A. (49) 

variance, but twenty percent 


3.2. Nonuniform background 

While it is useful to understand the performance of the various estimators in the specical case 
of uniform background, the whole purpose of these multi-annulus estimators is to improve the 
characterization of nonuniform backgrounds. With nonuniform backgrounds, the bias is in general 
nonzero, and so bias/variance tradeoffs need to be considered. 

For a two annulus estimator, if the background is nonuniform, then B[ nner ^ B 0 uter in general. 
Let 

B — aBi nner -f- (1 cr)-B ou t er (5b) 

be the average background, [] and let 

r -dinner -Bouter 


be a measure of nonuniformity which is restricted to — 1 < 5 < 1. Here, positive 6 corresponds to 
a “peak” at the source location, and negative 5 to a valley. Note that we can express arbitrary 
linear combinations of Bj nner and B ou t er in terms of the average background and this nonuniformity 
measure. 

uB inner + vB outer = (u + v)B + ((1 - a)u - av)6B (52) 



If the background nonuniformity is well modeled by a cubic polynomial, then the bias in the 
two-annulus estimator will be zero; this means that the bias for the one annulus estimator is given 
by the difference between the one and two annulus estimators. 

(Bgrc — B src ) = B — (/3-Binner + (1 — /3)B outer ) 

— (o/ij nner T (1 O::) Bonier j ( 1 Bn tier T (1 /?)B ou te r) 

= (a - P)6B (53) 

We can now write the total relative squared error (variance plus squared bias divided by B 2 ) for 
the one-annulus estimator: 

T 2 = 1 + (a - P) 2 S 2 (54) 


5 B is also the source background that would be estimated by the one-annulus estimator. Recall a was defined in 
equation (1) in terms of the ratio of areas of the inner and outer annuli. 
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where JV = AB is the expected number of counts in the background annulus. 

For the two-annulus estimator, the bias is zero but the variance is given by 

^2 = @ -dinner/dinner T (1 — /3) -Bouter / A 0 uter (55) 

Using equation (]52|) to expand the above sum, we can write an expression for the total relative 
squared error: 


Tl = V 2 /B 2 


i 2 , (i-/?n 


+ 


'f3 2 ( 1 - «) 


q 


(1 - (3) 2 a \ 

1 -a ) 


1 

AT 


(56) 


If |(5| <C 1, we can write more simply 


rp2 

1 2 



(a - /3) 2 \ 
q(l — a) I 


1 

a r 


(57) 


One prefers the two-annulus method if Tf < T 2 , and this happens when 


A fd 2 > 


1 


q(l — a) 


(58) 


The two-annulus estimator is preferred when the bias is large (<5 large) and the variance is small (A f 
large). If the background annuli cover a large area, then both of these conditions are more likely 
to hold. 


When the background is nonuniform, the optimal partition of the background annulus into two 
annuli is no longer into equal areas. In general, local peaks (6 > 0) prefer a larger inner annulus, 
and local valleys (6 < 0) prefer a larger outer annulus. However, the gain for mild nonlinearities is 
not substantial, and since one usually prefers to use the same-sized annuli for the entire map, we 
recommend A outer ~ Ai nner as a general rule of thumb. 


3.3. Combination of one- and two-annulus methods 


In comparing the one- and two-annulus methods, it is useful to recognize that both methods 
provide an estimate of B SIC as a linear combination of -Bi nner and B outer , with the only difference 
being the choice of coefficients: 


H s rc — 7-^inner T (1 7) Uoul er * 


(59) 


Here, 7 = q with q defined in equation (43) produces the one-annulus estimator, and 7 = f3 with /3 
defined in equation (|32|) gives the two-annulus estimator. Rather than try to make a dichotomous 
choice between the one- and two- annulus estimators, we can instead ask about the optimal choice 
of 7 . In principle, it is straightforward to derive the optimum; we have 



7 2 (1 — q) _ (1 -7) 2 q \ ) J_ 

1 — q i f M 


2 X 2 




(60) 
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so for a given A f and 5, we can take a derivative and set it to zero. Using the 6 <C 1 approximation, 
this produces: 

_ a + C/3 , . 

Toptimal i ^ (hi) 

where £ = Af5 2 a(l — a). Note that this expression for the optimal 7 depends on M and 5, which 
will vary over different parts of the map. There are two approaches we can take in this situation. 
One is to develop an “adaptive” formula for 7 that varies with position on the map; its value will 
depend on local properties of the map, which will have to be estimated at each position. A second 
approach is to find a single best 7 for the entire map. 


3.3.1. Adaptive linear combination 


The simplest adaptive linear combination of Bi nner and B outer uses the expression for 7 in 
equation ( | 6 l|) , but this requires an estimate of the nonuniformity 5. A natural estimator (following 
equation @)) takes 5 = (B 0 u t er — B inner )/B, but this will generally overestimate d 2 , which will 
make 7 larger than optimal. Roughly, 


(W 2 > 


S 2 + 


1 

a( 1 — a)N 


(62) 


which suggests using 6 2 = (5) 2 — Q ^ 1 _ 1 Q ^ ; in other words, use equation (61) with 


£ = max 



Mnner-^-outer l^outer 


(B 0 


-Bo 


(A inner + -4-outer )B 



(63) 


3.3.2. Average best linear combination 

Although the adaptive estimate of 7 in the previous section should in principle be the most 
accurate (at least for high enough count densities), we have obtained better estimates using a single 
“average best” 7 . Let B src denote the estimated counts per pixel in the source area using the actual 
counts in the source area: 

Bsrc = (64) 

^■src 

and let B' src be the estimated counts per pixel in the source area as estimated from the counts in 
the background annuli. If there is not a point source at the candidate location, then both estimates 
should roughly agree. Let 

A-B S rc = B s rc - B' rc (65) 

be the extent to which they do not agree. Note that AB src is a difference of two estimators ; it 
is a quantity that can be computed directly from a map without any knowledge of the actual 
background B src . 
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This empirical estimate of estimation error can be combined with equation (^) to produce a 
tool for choosing a single “average best” parameter 7 . Note that 

^B src = B ,src ^T-Binner T (1 7)Bouter^ (66) 

and 


((AB src ) 2 } — ((B src — -B ou ter) 2 ) — ^{(Bsrc ~ B ou ter) (Binner — B ou ter)) + J 2 ((Bi nner — B Q u ter) 2 ) (67) 


and this is a simple quadratic equation whose minimum occurs at 


Toptimal 


((B src Bouter) (Bj] 


-B, 


outer 


)) 


((^i, 


-B, 


outer ^ 


( 68 ) 


Since this expression uses B src which is estimated by counting photons in the source region, this 
estimate of 7 op timai will only be useful in a map that does not have a lot of real sources. If there are a 
few bright sources, these should be masked off in the estimate of 7 op timai; if there are many sources, 
then one must iteratively find, fit, and subtract off the real sources. The difficulties involved in this 
source-confused regime are beyond the scope of this article. 


4. Applications of multiple annuli 

We will illustrate a number of practical uses for multiple annulus background estimation on 
a real ALEXIS data set (figure [la) and on an artificial data set (figure |]b) which has a cubic 
polynomial background and five artifical point sources. Four different applications of multiple 
annulus methods will be provided: estimating background level; detecting point sources; generating 
surrogate maps for estimating false alarm rates; and quantifying background nonuniformity. 


4.1. Estimating background level 


In comparing various geometries of single and multiple annulus estimators, it is useful to have 


an index that measures the accuracy of these estimators directly from the data. In Section 3.3.2, we 
introduced an expression AB src = B src — B' src which describes the difference between two estimates 
of the background in a source region. The first (B src ) is a direct estimate from counts in the source 
region and the second ( B ' src ) is an indirect estimate obtained from annular regions that do not 
include the source region. In the absence of a real point source, we can evaluate the quality of the 
indirect annular estimate by comparing it to the direct estimate. 


Although we can compute AB s 
this as a difference of differences: 


directly from the data, it is useful to note that we can write 


AB src = (B src - B src ) - (B' rc - B src ) 


(69) 


where B src is the true (but unknown) background. This suggests that there are two contri¬ 
butions to the total variance in (AB src ) 2 : a source fluctuation and a background estimation error. 
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The background estimation error itself arises from two causes: the actual bias arising from the 
assumption that the background is a low-order polynomial, and the Poisson fluctuations in the 
counts in the annular regions. Both of these are independent of the source fluctuation error (which 
is due only to the Poisson fluctuation in the counts in the source region), so these two contributions 
to the variance (source fluctuation and background estimation error) are independent, and we can 
write: 

<(A4rc) 2 > = <(4rc - B stc ) 2 ) + <(5' rc - B src ) 2 ). (70) 

with equality holding when (•) denotes a true ensemble average. We can furthermore estimate the 
first term (the source fluctuation error) from the result in equation (fb)j : 


{(-Psrc B SIC ) ) — B src /A src ~ B SIC /A S 


(71) 


This gives an expression for the background estimation error: 

((B' src - B SIC ) 2 } « ((AB SIC ) 2 ) - 4rcMsrc (72) 


Dividing by the source fluctuation error, we can obtain a directly computable dimensionless quantity 
that we call the “empirical background error index”: 

£ = ((A ^ src)2) - 1 (73) 

(B src ) / A src 

which is essentially the ratio of the average background estimation error to the average source 
fluctuation error. A smaller value is better, though there is arguably a point of diminishing returns 
in obtaining a value very much less than one. 

In figure [| (resp. figure ^), we compare the empirical background error index for one- and two- 
annulus estimators as a function of outer annulus diameter for the ALEXIS (resp. artificial cubic 
background) map. For small annuli, the bias is smaller but the variance is larger, and preference goes 
to the one-annulus estimator which minimizes the variance. For larger annuli, the bias is larger 
but the variance is smaller, and preference goes to the two-annulus estimator, which minimizes 
bias at the expense of variance. For intermediate-sized annuli, the optimal estimator is a linear 
combination of the one and two annulus estimator. These are only qualitative trends; quantitative 
values ( e.g optimal linear coefficients, or optimal annulus size) depend on the background. What 
the empirical background error index provides is a way to estimate these quantitative values directly 
from the raw data. 


4.2. Detecting point sources 

Our main motivation in attempting to more accurately estimate the background is that this 
provides a more sensitive point source detections - in particular, it permits us to detect weaker real 
sources without increasing the false detection rate. 

When the background is known exactly, it is straightforward to assess the significance of a 
given point source candidate. If N src counts are observed in the source region, and the background 
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B src is known exactly, then S = N SIC — A src B src estimates the source strength. Under the null 
hypothesis, the quantity N src will be Poission distributed with mean A src B src ; the variance of N src 
(and therefore of S) will also be AsrcBsrc, so the “signal to noise ratio” will be given by 


S/N 


S 

/Va T(S) 


-Urc A sr c B s rc 


\/ /L 


(74) 


Under the null, S/N will have mean zero and variance one|] If the count rate is high, then S/N 
will furthermore be distributed as a Gaussian, and therefore it makes sense to think of the S/N 
as a number of “sigmas of significance” for a given point source detection. For small numbers of 
counts, the Gaussian approximation becomes inaccurate, and the statistic S/N is not sufficient to 
characterize the significance of the point source detection. In that case, however, an exact test, 
using Poisson statistics, is straightforward to derive. Gehrels (1986|) , for instance, provides tables 
and useful approximations for the Poisson formulas that arise in this context, pTpka. Cordes 


fe Wasserman (1994) ) argue for using a histogram to characterize the background; this avoids 
assumptions of Gaussian or even Poisson statistics, but requires that that the background be 
uniform over a large enough region that a good histogram can be acquired. 


But our interest is in the case that the background is not known, and must be estimated by 
counts in surrounding annuli. If the background is estimated by B' rc , then the signal is given by 
S = N src — A src B' src , and the “noise” has an extra contribution due to the variance of the estimator. 
In particular, for the multiple-annulus estimators, we can write 


s = n sic -a sic J2PsN s /a s 

s 


(75) 


and the variance is given by 


Var(S) = A src B src + A 2 src £ f3 2 s B s /A s 

S 


(76) 


where N s (resp. A s ) is the counts (resp. area) in the s’th annulus. A more sensitive statistic would 
employ a matched filter (see Vikhlinin et al. (1995 ) for a more extensive exposition on the use of 
a matched filter for source detection, a flat annulus for background subtraction, and Monte-Carlo 
tests for calibration). 


The more precisely we can estimate the background, the more sensitive is our test for point 
source candidates; in particular, anything that can be done to reduce this variance ( e.g using 
background annuli with large area A s ) will increase the statistical significance of any real sources, 
without altering the false alarm rate. This is a tangible motivation for estimating the background 
as precisely as possible. 


However, exact values of B src and B s are not known in general, so the variance itself must be 
estimated. For the one-annulus estimator, a number of authors have shown how this should be done. 
In particular, Li fe Ma (1983) argue (and demonstrate numerically as well) that one should use the 


®Note that we do not use \/JVsrc in the denominator, as some authors recommend. If we did, then S/N would not 
have mean zero and variance one under the null hypothesis. 
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null hypothesis that the background is uniform, and estimate B = ( N SIC + A r back)/(^4src + Aback)- 
That is, 

Var(S') ~ A src B + Ag rc I?/Aback = (A src /Aback )(As rc T Aback) (77) 


and 


S/N = 


N src - {A src /Aback ) -Aback 


(78) 


src /Aback )(A src T -^back) 

This statistic will have mean zero and variance one as long as the background is flat (B src = B back) 
and there are no point sources. It is only approximately Gaussian, however, and the approximation 
becomes poorer as the number of counts becomes small. Li &; Ma (1983| ) also provide a much more 
complicated estimate of S/N which is more accurately Gaussian with lower counts. This more 


complicated expression can be further extended to account for weighted counts ( Theiler &; Bloch 
1997a|) . The Cash statistic ( Cash 1979 ; Nousek &; Shue 1989|) provides a good tool for parameter 
estimation and confidence intervals even when the number of counts in individual bins is small, but 
it is not really designed for point source detection. Damiani et al. (1997 ) employ a wavelet-based 
point source detector, essentially a continuum circular source kernel and annular background, and 
provide an empirical correction to Gaussian statistics with coefficients determined from a set of 
Monte-Carlo runs. 


In Lampton (1994 ), the essentially binomial character of the counts in the source and back¬ 
ground regions is exploited to produce an exact expression for statistical significance of counts in a 
source and background region. An extension of these exact results to nested annuli is described in 
Theiler & Bloch (1997b). Interestingly, Lampton’s formula is equivalent to one derived earlier in 
Alexandreas et al. (1993|), but the earlier derivation is based on an entirely different approach. 


But all of these exact results and corrections to Gaussian statistics assume a uniform back¬ 
ground. For the nonuniform background, at least for the purposes of this study, we will employ the 
implicit Gaussian assumption in our use of signal to noise (or “sigmas”) to characterize significance. 
However, in the next section, we will describe an empirical approach for calibrating these sigmas 
to actual significance. 


The “signal” and “noise” for the multiple-annulus estimators are given in equation (75) and 
equation (|76|). As with the one-annulus estimator, the signal depends only on the counts in and the 
areas of the source region and background annuli, but the noise depends on the true background 
levels. To keep things simple, we will follow the approach of Li fe Ma (1983|), and use B = 


Atotai/Atotai where IVtotai — -A src -I- A. c N s nud A^otai — A src T Es A s ■ then 


Var(S') — iV tota i(A src /A total ) ( 1 T A src @1 / A s 


(79) 


and we measure significance with 

S/N = 


N src - ^4-src EsPsN s /A s 


\/ -^total {A src A^-total) (IT -^■src E sft/As 


(80) 


In figure [?], we show maps of the significance S/N for the ALEXIS data, as estimated using 
one- and two-annulus methods. A number of bright O and B stars and previously identified bright 
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extreme ultraviolet sources ( Bowyer et al. 1996| ) are readily identifiable, as well as others that may 
be unidentified transient sources or false alarms; a more detailed study of these detections will be 
reported in a later paper. Table [I] shows all sources that are detected at the four sigma level using 
one of the two methods. For those significant at the five sigma level, identifications are provided 
as well. For these identified sources, there is a fairly close correspondence between the significance 
level of the two methods, though the two-annulus detector is usually the more significant. 


In figure |8], we show significance maps for the cubic background map, again using one- and 
two-annulus methods. Point 1, the lower left artificial point source that is at the bottom of the 
trough, is barely detected by the one-annulus method with a significance of just above 3.0. From 
the figure, it also appears that the one-annulus detector exhibits a spatially uneven number of false 
alarms, with most of the false alarms occuring in the upper right quadrant where the background 
has a peak. By contrast, the two-annulus method produces false detections that are spread more 
evenly over the map, and the real source in the lower left quadrant is more readily identified. 


To study this effect more systematically, we repeated this experiment a thousand times, and 
the results are shown in Table The two-annulus detector was slightly better, on average, than the 
one-annulus detector for points 2, 4, and 5; these are points on the diagonal where the background 
curvature is small. The two-annulus detector was substantially better for point 1, which is in the 
the trough. The one-annulus detector reported a larger sigma for point 3, the upper right point 
source, but that is also where the one-annulus detector has its highest false alarm rate. In fact, the 
one-annulus detector reported an average significance (4.2 sigmas) for this point that was larger 
than the nominal significance (4.0 sigmas) that was designed into the experiment. 


4.3. Generating “surrogate maps” 


It is sometimes possible to analytically characterize the false alarm rate (or p- value) for simple 
point source detection algorithms when applied to a single candidate point source; but it is gener¬ 
ally a lot more complicated, and often is outright intractable, to obtain false alarm rates for entire 
maps. In general, one discounts the significance of a candidate point source by the number of inde¬ 
pendent point source detection attempts (or “trials”). But when the source regions and background 
annuli act as “moving windows” over the map, the detection attempts are not independent, and it 
is nontrivial to get the correct trials factor. Saja (1995| ) has discussed the spatial overlapping win¬ 
dow problem, and has computed (by numerical integration) the “completeness” versus “spurious 
detection rate” for a given magnitude limit and signal to noise ratio. Biller et al. (1994 ) look at the 
simpler case of overlapping time windows, and provide an empirical fit to the trials factor (see Table 
3 of that reference). See Biller (1996 ) for a more extensive discussion of trial factors in the context 
of combining statistical tests in an open-ended search. If the background is known precisely, and 
the sources are sufficiently rare, the problem can be treated analytically ( Politzer fc Preskill 1986 ), 
but for our unknown and spatially nonuniform backgrounds, we resort to Monte-Carlo calculation. 


If we have a good estimate of the background at each pixel, we can can make an ensemble of 
“surrogate maps” by assigning each pixel a value chosen from a Poisson distribution with mean 
equal to the background at the pixel. The surrogate maps, by definition, do not have point sources, 
so any point source detections will be spurious. Applying the point source detection algorithm to 
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a large number of such maps, one can estimate of the false alarm rate, and in particular, can find 
an appropriate threshold for the “number of sigmas” of significance a single source must exhibit to 
be significant at a given level for the entire map. 

The most straightforward way to estimate the background at each pixel is to use a one or two 
annulus estimator, as we have discussed in earlier sections. However, this generates a background 
map which is not smoothly varying from pixel to pixel, but instead includes fluctuation that arises 
from the Poisson statistics in the original map. Such surrogates generate more false alarms than 
are consistent with the null hypothesis of a smoothly varying background. 

Our approach for generating surrogate maps is to find a background function B(x, y ) which is as 
smooth as possible while still being consistent with the original map. We can define “smoothness” 
in terms of the two-annulus estimator. A “perfectly smooth” map is one that can be modeled 
exactly as a cubic polynomial; such a map will satisfy 


Bjj — (3Bi nner + (1 — P)B ou ter 


(81) 


where H inner and B outei . are averages of B,j over the inner and outer annuli, and (3 is given by 
equation (32). Exact equality is an unreasonable demand, because that implies a single cubic 


polynomial fit over the entire background. However, we can try to satisfy this equality as closely as 
possible, while constraining the background to be consistent with the data. In particular, we want 


(Nij - Bijf 


B 


1. 


(82) 




To find a Bij which simultaneously satisfies both (approximate) constraints, we take an itera¬ 
tive approach: 


1. On the first iteration, we estimate B^ at each pixel using the “average best” linear combina¬ 
tion of two annuli described in Section 3.3.2| . 

2. On subsequent iterations, we refine this estimate of B^ using the straight two-annulus method 
as defined in equation ([3|) and equation (32). 

3. For each of these subsequent iterations, however, we perturb our estimate by a small fraction 
e so that it does not stray too far from the data. That is, 


B 


(n+l) 


(1 - e) (/3B^ er + (1 - /3)B£L) + eNij. 


(83) 


In our implementation, the e is adjusted in an ad hoc manner until equation (82) is satisfied; 
it is not difficult to imagine ways to automate this selection. 


This background map is computed just once; multiple surrogate maps are generated by replac¬ 
ing each pixel of the background map with an integer chosen randomly from a Poission distribution 
with mean equal to the floating point background value. Figure || shows the results of one thousand 
surrogate maps; for each map, the point source algorithm is applied, and the statistics of point 
source detections (which by definition are all spurious) are maintained in a cumulative histogram. 
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This histogram provides a calibration which tells how many spurious detections are expected in 
the map as a function of “sigmas of significance” for a single detection. The dotted lines in fig¬ 
ure H indicate the required number of sigmas for a single source detection in order to achieve the 
traditional level of p=0.05 significance for the entire map. For the ALEXIS map, this value is 5.0 
sigmas; for the artifical map, it is about 4.6 sigmas. 

From this calibration, we can compare the false alarm rates of the one and two annulus de¬ 
tectors. For the ALEXIS data, as seen in figure |ja, there is little difference between the two. 
There is a discernible difference for the artificial cubic background map; as shown in figure 0a, 
the one-annulus method produces approximately 20% more false alarms. This is not in itself a 
problem for the one-annulus detector, since the important thing is that the false alarm rate can 
be estimated; one simply adjusts the level of S/N that is needed for a confident detection. More 
worrying is that fact that the one-annulus method’s false alarms do not occur uniformly in the 
map; if we restrict our attention to the upper right quadrant (where the background is generally 
peaked), then figure 0b shows that the false alarm rate for the one-annulus method is roughly 
double that of the two-annulus method. 


Since we know the true background for the artificial map, we can use this to asses how well 
the surrogate maps estimate the true false alarm rates. To estimate these true rates, we performed 
a Monte-Carlo experiment with one thousand realizations of the artificial cubic background map, 
but without any point sources added, and applied the one-annulus and two-annulus detectors to 
these realizations. Figure 11 shows the ratio of the false alarm rate estimated from the surrogate 
maps to the true false alarm rate estimated from the Monte-Carlo experiment. This ratio is very 
nearly one (to within estimation error) for the one-annulus detector, and within a few percent of 
one for the two-annulus detector. That the surrogate maps provide a slightly higher false alarm 
rate leads to slightly more conservative point source detections in the real map. 


4.4. Characterizing background nonuniformity 


Although the multiple and continuum annuli estimators can account for smooth (polynomial) 
background nonuniformity when the count density is not too low, there are still times when a single¬ 
annulus estimator is preferred. As we’ve seen, a single-annulus estimator has lower variance than 
a multiple-annulus estimator. Under the assumption of uniform background, one can furthermore 
obtain an exact test of significance ( e.g Lampton 1994| ), even for arbitrarily low counts. But even 
in this single-annulus scenario, it is useful to check the assumption of background uniformity. 


One natural measure, which was originally used on the ALEXIS project, is the Poisson dis¬ 
persion index ( Freund fe Williams 1966| ). If N t is the count in the i-th pixel in the background 
annulus, and there are a total of K pixels, then the statistic 


X = 


Eiii(Ni-n) 2 

n 


(84) 


where n = = N/K is the average counts per pixel, will be approximately chi-square 

distributed with K — 1 degrees of freedom. If X is large {X K — 1), that indicates a pixel-to- 
pixel nonuniformity that is larger than can be accounted for by Poisson variation. 
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This statistic is less than ideal from the point of view of point source detection for several 
reasons. Since there is no spatial information coded into the statistic, it is only sensitive to pixel- 
by-pixel variations. This makes it relatively insensitive to a weak nonuniformity that is spread over 
a large area; this is a problem that is exacerbated when the pixels are small. A related small-pixel 
problem occurs when there are many more pixels than photons (when n is much less than one). In 
this regime, the distribution of X is no longer expected to be chi-square. In fact, in the limit as 
n —> 0, each pixel will contain either zero photons or one, and X = ( K 2 — N 2 )/N regardless of how 
those photons are arranged. 

On the other side of the coin, the Poisson dispersion index is sensitive to all kinds of deviations 
from uniformity, including those, such as a linear gradient, for which a single-annulus estimator is 
quite adequate (is optimal in fact) for estimating the background in the source area. The statis¬ 
tic will (correctly) indicate that the background is nonuniform, but a perfectly good background 
estimate will be needlessly disqualified. 

Using concentric annuli, we will develop a statistic that is both more sensitive to weak quadratic 
nonuniformities and at the same time insensitive to such “benign” nonuniformities as linear gradi¬ 
ents (or any odd polynomial powers).]] We will define a statistic which is a simple linear function 
of the counts in the annuli: 


t = J2vsN s 

s 

where the coefficients Vs are chosen to satisfy the conditions 

(85) 

Y VsA s = 0, 

s 

(86) 

Y r is A s/ Y = L 

(87) 


S S 


This statistic will have mean zero, and variance ( T 2 ) = N, where N = )T( S N s is the total number 
of counts. In particular, we will be able to reject the null hypothesis of uniform background with 
T/\fN “sigmas” of significance. If there are two annuli, then there is only one solution for r] s : 
Vinner = \/(l — a)/a, and r/ out e r = \Ja/(l — a). For more than two annuli, we note, without going 
into detail, that it is possible to choose the coefficients Vs so that the statistic is optimally sensitive 
to quadratic nonuniformities - these are the nonuniformities that lead to the most serious mis¬ 
estimates of background when the single annulus estimator is used (a continuum-limit statistic, in 
which the coefficients Vs are replaced by a function r/(r), is also straightforward to derive). Figure [l2| 
compares the Poisson dispersion index and a two-annulus statistic, showing that the two-annulus 
statistic is completely insensitive to to linear gradients, but is more sensitive to weak quadradic 
nonuniformities. 

The last two columns of Table ||] show how the annulus-based background statistic is able 
distinguish troughs (negative value at point 1), peaks (positive value at point 3), and low curvature 


7 Actually, it is not a bad idea to directly employ the Poisson dispersion index in equation (|8^), but with annular 
regions in place of individual pixels. If the regions have different areas, then a slight modification is needed, but 
such a statistic is by design insensitve to linear gradients, and since annular regions are generally much larger than 
individual pixels, one expects the chi-squared approximation to be more accurate. 
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points (2, 4, and 5) in the background nonuniformity. As with ordinary point source detection, 
there is a trade in the choice of annulus size. Choosing an outer annulus equal in size to the annulus 
that is used for point source detection provides an appropriately local measure of nonuniformity; 
choosing a larger annulus size however provides a more sensitive measure. 


In the special case that there are two background annuli, then the distribution of (-/Vm ner , -/V outer ) 
is binomial and an exact p-value can be computed in terms of the incomplete beta function. This 
was pointed out by Lampton (1994| ) in the context of determining whether the counts in a source 
region are consistent with the counts in a single background annulus. In particular, a one-sided 
p-value is given by the probability of seeing IVinner or more counts in the inner annulus. 


P = 



a n ( 1 - a) N ~ n = I a (N innei , N outei + 1) 


( 88 ) 


where N = Ni nner +N outeT , and a = Ai nner /(Ai nner + A ou t er ), and I a is the incomplete beta function. 
The one-sided p-value will be small when IVinner is unusually large; this occurs when the background 
is “peaked” near the candidate point source location, the very nonuniformity that is most likely to 
lead to a spurious point source detection. 

We remark that this statistic does not “assume” that the background is smoothly varying, or 
well-modeled by a polynomial. It is simply a test of the null hypothesis that the background is 
uniform. It will detect many different kinds of deviations from uniformity, but is designed to be 
particularly powerful against case that the background has a smooth convexity (or “peakiness”). 


5. Conclusion 


We have derived linear multiple-annulus estimators that provide unbiased background estima¬ 
tion when the background is spatially nonuniform but smoothly varying. The approach works for 
both circular and square background annuli, and can be conveniently applied to photon event lists 
as well as to maps of the sky with photons binned into square pixels. One trades accuracy for pre¬ 
cision (lower bias for increased variance) in going to a multiple-annulus estimator, and if the count 
rate is low or if the background is nearly uniform, the trade becomes unfavorable. Fortunately, one 
does not have to depend on general trends or asymptotic results to inform this trade; the empirical 
background error index, described in Section 4.1, provides a simple figure of merit that can be 


computed directly from the data. As well as comparing distinct background estimation methods, 
it can be used to define an optimal annulus size and/or an optimal linear combination of the one 
and two annulus estimators. 


We have seen that multiple-annulus background estimators can, in some cases at least (e.g., 
the cubic background map), provide more robust point source detection. But if the count density 
in the map is not very high, the increased variance of the multiple-annulus estimators may lead 
to a preference for simple one-annulus point source detection. But even in this regime, multiple- 
annulus methods can play a supporting role. We have shown how a multiple-annulus smoothness 
condition can be used to generate nonuniform “surrogate maps” which are can be used for assessing 
false detection rates and significance thresholds. We have also shown how to use multiple annuli 
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to characterize the nonuniformity of the background; potential point sources that are detected at 
places on the sky map with a high degree of nonuniformity will be more suspicious than those that 
are detected in flatter regions. 

At this writing, the ALEXIS satellite is still flying, and is still taking data. The ALEXIS daily 
point source detection effort is described by [Roussel-Dupre et al. (1996|) . As the mission nears 
completion, work is in progress on a final point source catalog. 
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Fig. 1.— (a) This data set constitutes one face, centered at 90° Right Ascension, and 0° Decli¬ 
nation, of the quadrilateralized spherical cube projection of a full-sky map generated by co-adding 
49 months of raw count data from one of the six ALEXIS telescopes. The telescope we chose (IB), 
has a high signal to noise ratio compared to the other telescopes, and detects extreme ultraviolet 
photons in a narrow band centered at 172A. The brightest object in this view is Sirius B, the white 
dwarf companion of the visibly brightest star in the sky; it is in the lower left quadrant, but is 
not easy to see before subtracting the background. The contour lines indicate the irregular, but 
relatively smooth, spatial nonuniformity of the background; most of this nonuniformity is due to 
the uneven exposure of the scanning telescope. To avoid edge effects, the 360x360 map has been 
extended by 50 pixels in each direction by including pixels from the four adjacent faces, and the 
corners have been filled in by reflecting pixels across the edges. The padded pixels are used only 
for background annuli; the point sources are only detected in the original quad-cube face, (b) An 
artificial data set was generated to have an exactly cubic polynomial background. Five artificial 
point sources were added at locations on the map indicated by the enclosing squares. All five have 
a source strength that would be significant at 4 sigmas, if the background were precisely known. 
The points are arranged like the five dots on a die, and each “point” is in fact a 3x3 source region 
with weight 0.2 in the center and 0.1 at each of the eight surrounding pixels. In this 200x200 map, 
the outer 30 pixels are reserved for “padding;” points are detected only in the interior. 
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Fig. 2 .— The one-annulus method estimates background in the source region (src) with the observed 
count density in the background annulus (back). Using two annuli, the count density in the inner 
and outer annuli are extrapolated to the center to provide an estimate of the average background 
in the source region. 



0 

Distance from point source, R 2 


Fig. 3.- The background in the source region (src) is estimated from the counts per pixel in 
the inner and outer annuli. The two-annulus estimator (open square symbol) provides a direct 
extrapolation of the outer and inner count densities to the source region. The one-annulus estimator 
(open circle symbol) treats the inner and outer annuli as a single annulus, and so computes an 
average of the inner and outer background estimates weighted by their respective areas. The error 
bars indicate plus and minus one-sigma errors (where “sigma” is the square root of the variance 
of the estimator) for each of the estimated background levels. The two-annulus estimator has a 
much larger variance than the one-annulus estimate, but it is unbiased even for cubic polynomial 
nonuniformities in the background. 
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Fig. 4.— Radial kernels for background estimation at the origin: (a) Simple one-annulus esti¬ 
mator; if the background is uniform, then this provides the optimal estimate, (b) Two-annulus 
estimator; if the background has nonuniformities which can be modelled with a cubic polynomial, 
then this estimator is unbiased, (c) The continuum limit of a multi-annulus estimator is a kernel 
function which varies with radius. One can derive an optimal kernel function - see equation ( |47|) 
- which produces the least estimator variance while being constrained to being unbiased for cubic 
nonuniformities. 
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Fig. 5.— The Empirical Background Error Index for the ALEXIS map shown in figure |lja is 
plotted against outer annulus diameter for the simple one-annulus estimator (circles), for the two- 
annulus estimator in equation ( |3l| ) (squares), for the “average best” linear combination of one- 
and two- annulus estimators (dotted line, with diamonds). A smaller number indicates a more 
accurate background estimation. The annuli were square with odd-integer diameters; the source 
region was 3x3 pixels, and the hole was 5x5. For the two annulus estimators, the annulus was 
partitioned to provide approximately equal areas for the inner and outer annulus. The dotted line 
in the upper panel shows the coefficient of the best linear combination, rescaled so that a value of 
zero corresponds to the one-annulus estimator, and a value of one to the two-annulus estimator. 
For small annuli, the one-annulus estimator is preferred, but for larger annuli, the two-annulus 
estimator is superior. As the annulus diameter increases further, however, the approximation of 
the background as a cubic polynomial surface breaks down, and the two-annulus estimator gets 
worse. At their best annular diameters, the one-annulus and the two-annulus estimators exhibit 
virtually identical performance for this data set. 
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Outer Diameter 


Fig. 6.— The Empirical Background Error Index for the artifical cubic background map shown in 
figure |l]b is plotted in just the same way as shown in figure (5| Again, the one-annulus estimator is 
preferred if the annulus is small, but the two-annulus estimator is better as the annulus size grows. 
In this example, since the background is precisely cubic over the entire map, the index decreases 
essentially monotonically with increasing annulus size. 
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Fig. 7. — Significance map for the ALEXIS map shown in figure [l]a, using (a) a one-annulus detector 
with a 13x13 pixel background, and (b) a two-annulus detector with a 25x25 inner annulus and a 
35x35 outer annulus. In both cases, the source region was 3x3 and the annular hole was 5x5. The 
significance is in units of “sigmas”, or more specifically, the S/N statistic of equation (|80|). Only 
significances above three sigmas are shown, and point source detections with S/N > 4 are indicated 
with squares. (Those with S/N > 5 are also listed in Table 1.) A number of known white dwarfs 
and bright O and B starts are visible in this map, including the distinctive belt of Orion, which is 
near the center of this map. 




Fig. 8.— Significance map for the artificially generated cubic background map shown in figure jljb, 
using (a) a one-annulus detector with a 13x13 pixel background, and (b) a two-annulus detector 
with a 29x29 inner annulus, and 41x41 outer annulus. In both cases, the source region was 3x3 
and the annular hole was 5x5. Point 1 (lower left) is seen to be more significant in the two-annulus 
map than in the one-annulus map. Also, there is a preponderance of false alarms in the upper right 
quadrant of the one-annulus map. 
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Sigmas of significance Sigmas of significance 


Fig. 9.— Cumulative histograms generated from one thousand surrogate maps of both (a) the 
ALEXIS map (figure jl|a) and (b) the artificial cubic background map (figure |l]b), using both 
one-annulus and two-annulus point-source detection strategies (annulus sizes are described in the 
captions to figure [?] and figure ||). The horizontal axis is significance S/N, and the vertical axis is 
the average number of spurious sources per map which are significant at that level or greater. The 
dotted line corresponds to the traditional p=0.05 level of significance. 
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Fig. 10.— (a) The ratio of false alarm rates for the one- and two- annulus detectors is shown for 
the artificial cubic map (see figure Pa), (b) We restrict attention to the upper-right quadrant of 
the cubic map; in this quadrant the background is peak-like, and we see that the false alarm rate 
is roughly twice as large with the one-annulus detector compared to the two-annulus detector. 
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Fig. 11.— The false alarm rate estimated from the surrogate maps is divided by the “true” 
false alarm rate estimated from a Monte-Carlo run with 1000 realizations of the artificial cubic- 
background map (with no artificially added point sources). This ratio is plotted against significance 
threshold for (a) the one-annulus detector and (b) the two-annulus detector. 
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Fig. 12.— Results of a Monte-Carlo experiment comparing the sensitivity of the Poisson Dis¬ 
persion Index (PDI) (dotted line) in equation (84) with an annulus-based statistic (solid line) in 
equation (^), over a range of nonuniform backgrounds. The background is generated by a quadratic 
equation: Bij = c + bi + a(i 2 + j 2 ), where i,j are the pixel indices with values i = 0, j = 0 corre¬ 
sponding to the center pixel of a 13x13 map. For each trial, a Poisson realization was constructed 
for the given background, and the two-annulus statistic (using a 9x9 inner annulus) and the PDI 
was computed. Both statistics have mean zero and variance one when the background is flat, and 
so both can be used as a number of sigmas of signficance. Plotted is the mean (with error bars 
indicating the standard deviation of the mean) value of each statisic for 400 trials. In (a), the 
slope b is varied while keeping the quadratic coefficient a = 0, and in (b) the quadratic coefficient 
a is varied while keeping the slope b = 0. In both cases c = 100 counts. We see from (a) that 
the annulus-based statistic is completely insensitive to simple linear gradients, whereas the PDI is 
sensitive even to gradients of order one percent per pixel. (We do not mean to advocate the PDI 
as a sensitive detector of gradients; the high sensitivity in this case is a function of the relatively 
large counts per pixel and the large number of pixels in the background annulus.) By contrast, we 
see from (b) that the annulus-based statistic is more sensitive to small quadratic nonuniformities. 
Note that a negative coefficient a corresponds to a “peak” in nonuniformity, and these peaks in the 
background are just the artifact that are most likely to lead to spurious source detections. Positive 
values of a (“valleys” in the background) lead to negative significance. If a two-tailed test of flatness 
is desired, then the absolute value of the statistic should be used; this is shown as a dashed line in 
this plot. 






Table 1. Point source detections in the ALEXIS map with S/N>5 


ID 

Coordinates 

RA Dec 

Significance S/N 
Two-annulus One-annulus 

Sirius B 

101.11 

-16.84 

38.63 

37.65 

RE J0457—280 

74.24 

-28.25 

24.26 

23.75 

WD 0549+158 

88.09 

15.96 

17.16 

16.17 

RE J0515+324 

78.74 

32.72 

10.17 

10.14 

e CMa a 

104.45 

-29.05 

9.53 

8.74 

C Pup a 

120.68 

-39.96 

9.17 

9.34 

RE J0723—274 

110.73 

-27.80 

8.52 

8.55 

( Ori a 

85.13 

-1.83 

8.30 

7.69 

7 Ori a 

81.20 

6.42 

7.25 

7.05 

RE 0623-374 

95.85 

-37.65 

6.20 

6.55 

p CMa a 

95.51 

-18.14 

6.15 

5.88 

e Ori a 

84.08 

-1.19 

6.13 

5.52 

p Ori a 

78.47 

-8.30 

5.51 

5.08 

V471 Tau 

57.39 

17.39 

5.11 

5.12 


a These sources are bright O and B stars, and are likely to have 
been detected because of known UV leaks in the detector filters, not 
because of radiation in the ALEXIS bandpass. 
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Table 2. Significance of artificial points in cubic background map 


Significance S/N ac Background nonuniformity bc 

Point Two-annulus One-annulus Large Annuli Small Annuli 


1 

3.55 

3.11 

-49.67 

-1.50 

2 

3.82 

3.77 

0.01 

-0.06 

3 

3.98 

4.20 

20.68 

0.54 

4 

3.83 

3.77 

0.02 

-0.00 

5 

3.84 

3.70 

0.03 

0.08 


a If the background were known exactly, the nominal significance for 
these artificial points would be S/N = 4.0. 


b This statistic is defined in Section 4.4. The large annuli are 29x29 
and 41x41: the small annuli are 9x9 and 13x13. 


c The statistical error for all the numbers reported in this table is ±0.03. 







